COUPLING TIMES WITH AMBIGUITIES FOR PARTICLE 
SYSTEMS AND APPLICATIONS TO CONTEXT-DEPENDENT 
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• Abstract. We define a notion of coupling time with ambiguities for interact- 

Q' ing particle systems, and show how this can be used to prove ergodicity and 

to bound the convergence time to equilibrium and the decay of correlations 
at equilibrium. A motivation is to provide simple conditions which ensure 
that perturbed particle systems share some properties of the underlying un- 
perturbed system. We apply these results to context-dependent substitution 
I models recently introduced by molecular biologists as descriptions of DNA 

, evolution processes. These models take into account the influence of the neigh- 

• 1 boring bases on the substitution probabilities at a site of the DNA sequence, 

' as opposed to most usual substitution models which assume that sites evolve 

I independently of each other. 
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1. Introduction and motivations 

This paper is devoted to interacting particle systems on the integer line Z with 
finite state space S, whose dynamics is characterized by a finite list of stochastic 
transition rules. We now give an informal description of the dynamics that we 
consider for these systems, and we postpone a proper mathematical definition to 
section [2l 
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1.1. Construction of interacting particle systems dynamics. We begin with 
some vocabulary. A state s is an element of S, a site x is an element of Z, a 
configuration ^ := (^(a;))a:gz is an element of S'^ . A rule TZ := (c, r) is based on a 
context c and characterized by a rate r. A context is a triple c := (A,^, s), where 
A is a finite subset of Z, ^ is a subset of S^, s is a state, and r is a rate, that is, a 
non-negative real number. 

We say that a configuration ^ and a context c = (A, ^, s), or any rule TZ = (c, r) 
based on c, are compatible at site x if A is empty, or if A is not empty and S,{x + A) 
belongs to £, where £_{x + A) is the element of defined as 

^(x + A) := i^{x + y))yeA. 

The interacting particle system is a Markov process {Xt)t on whose dynamics 
is characterized by a given finite list $H := (7^i)iea of stochastic transition rules, as 
follows: for any time t, if a rule TZi = (q, r^) in £H with q — (Ai, £i, Si) is compatible 
with Xt at site x, then Xt+dtix) — Si with probability r^dt + o(di), independently 
of every other rule in 5H, compatible with Xt at site x or elsewhere. 

A classical way to give a more explicit construction of such particle systems uses the 
so-called graphical representation (see for instance [iT] page 142 for a discussion in 
the context of voter models). This amounts to a stochastic flow based on Poisson 
processes: given a time t and an initial condition ^ in S'^ imposed at time t, the 
Poisson processes determine the state of the particle system at every time greater 
than t. Once again informally, to every site x and rule TZt = (c^, r^) in $H corresponds 
a homogenous Poisson process ^{x, i) on the real line M with rate r^, and the points 
of ^[x, i) are the random times at which the rule TZi is apphed to the state at site 
X. Specifically, for every rule TZi — {ci,ri) in EH with context Ci — {Ai,£i,Si), if 
t belongs to '^(x,i) and if TZi and Xf.~ are compatible at site x, then Xt{x) — s; 
otherwise, Xt{x) — Xt-{x). See section [5] for a proper definition. 

1.2. Coupling times. Within this framework, various notions of coupling times 
can be defined. In this paper, an ordinary coupling time is an almost surely finite 
random variable T with negative values, measurable with respect to the family 
i'i!{x,i))(^x,i)Gi,x3 of Poisson processes, and such that, for every time u < T, if the 
dynamics starts at time u, the state of site a; = at time t = 0^ is the same for 
every initial condition at time u. This definition corresponds to a coupling from 
the past, as opposed to the usual notion of forward coupling. 

As soon as such coupling times exist, the particle system is ergodic. Furthermore, 
estimates on the tail of T yield estimates on the rate of convergence to equilibrium, 
and additional assumptions on the coupling time yield estimates on the decay of 
correlations. Consider now the set of points 

r:= y i^{x,i)n[T,0[) X {x}, 

{x,i) 

where the union runs over every a; in Z and i in 3. A point in T corresponds to a 
transition that may or may not be performed between the times t = T and t = 0~, 
depending on the initial condition at time v < T. When, for a given {u, x) in T, 
there indeed exists v < T and two distinct initial conditions at time v such that, 
for one of these initial conditions, the transition proposed by (m, x) is performed, 
while it is not performed when the other initial condition is used, we say that an 
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ambiguity arises at (u, x). By the definition of an ordinary coupling time, one sees 
that, for each time in T, cither there is no ambiguity associated with it, or there is 
an ambiguity that has no influence on the state of site a; = at time t — 0^ . 

We can now define, once again informally, the notion of coupling time with ambi- 
guities. This is a pair (H,T), where T is a random variable with negative values, 
measurable with respect to the family {^{x, i))(x,i)ezx3 of Poisson processes and H 
is a finite random subset of the set T defined above, enjoying the stopping property, 
and such that the following property holds: for any two initial conditions at time 
u < T such that the ambiguities associated with the elements of H are resolved 
in the same way (that is, a transition corresponding to an element of H is either 
performed for both initial conditions, or not performed for both initial conditions), 
the state of site a; = at time t = 0^ is the same for both initial conditions. 

One sees that, if (77, T) is a coupling time with ambiguities, T may or may not 
be an ordinary coupling time. However, the only ambiguities that may prevent T 
from being an ordinary coupling time are those associated to the points in H. As 
a consequence, in the degenerate case when H is empty, T is indeed an ordinary 
coupling time. 

Informally, our main result is that, if the random set H contains few enough points 
on average (we call subcritical any coupling time with ambiguities enjoying this 
property), it is possible to build an ordinary coupling time from (H, T), thus prov- 
ing ergodicity of the particle system. Moreover, more specific estimates and as- 
sumptions about the set H provide estimates on this ordinary coupling time, that 
are suitable to study the rate of convergence to equilibrium of the particle system 
and the decay of its correlations. 

The construction of an ordinary coupling time from a subcritical coupling time 
with ambiguities is described in section [H The principle of this construction is to 
apply iteratively coupling times with ambiguities, looking further and further into 
the past, until every ambiguity is eventually resolved. 

1.3. Perturbed particle systems. We now describe how these results allow to 
study some perturbed particle systems. We assume that the list of transition rules 
is of the form d\ = {^i)ie3°udp , where 3° and 3^ are disjoint sets, the family W := 
(^j)ie3° being the family of so-called non-perturbative rules, while W :— {^i)ie:ip 
is the family of so-called perturbative rules. 

We call the interacting particle system based on the whole family of rules D\ the 
perturbed system and the system based on the family of non-perturbative rules [H° 
the unperturbed system. 

A general problem about perturbations of particle systems is to relate the properties 
of the perturbed system such as ergodicity, speed of convergence to equilibrium or 
decay of correlations at equilibrium, to those of the unperturbed system, when the 
transition rates attached to the perturbative rules arc small enough. In this context, 
we wish to mention two results, one on the negative side and one on the positive 
side: 

(1) Small perturbations of ergodic particle systems may not be ergodic. For 
a well-known example, consider the two-dimensional Ising model. Its dy- 
namics is ergodic at the critical inverse temperature /3c and not ergodic at 
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any inverse temperature /3 > /9c, see [10] (page 204 and Theorem 2.16 on 
page 195) for instance. 
(2) Small perturbations of particle systems whose coordinates evolve indepen- 
dently are ergodic, see |10j (Theorem 4.1 on page 31) for instance. 

Depending on the assumptions one makes about the unperturbed system, and on 
the perturbations one considers, one can use various methods to deal with this 
problem. For an example of the use of techniques and spectral gap estimates, 
see |9]. For an example of the use of cluster-expansion estimates, see [13]. For 
examples involving Lyapounov function techniques, in the slightly different context 
of perturbations of Markov chains, see [T] f6l ITSl I16j. 

The approach of this paper is based on coupling. The basic idea is that, in some 
situations, it should be possible to rely on the coupling properties of the unper- 
turbed system to devise a coupling time with ambiguities (i? , T) for the perturbed 
system, with the property that, when the rates associated with perturbative rules 
are small enough, the coupling time with ambiguities is subcritical. We do not 
provide an abstract formulation of this idea, but, as an illustration, we give two 
concrete examples in section [S] in the context of stochastic models of nucleotide 
substitution in molecular evolution, recently studied in [3j. 

1.4. Organization of the paper. Section [2] contains a formal definition of the 
interacting particle systems studied in this paper, their construction by means of 
Poisson processes, the definition of the notion of coupling time with ambiguities, 
and some notations. Section [3] contains the main results, whose proofs are in 
section 2] Section [5] applies these theoretical results to a concrete case, namely 
a class of stochastic models of context-dependent nucleotide substitution, recently 
introduced by molecular biologists, and whose study was our initial motivation for 
the results in this paper. 

2. Formal setting 

2.1. Preliminary definitions and notations. In this paper, particle systems 
are continuous-time Markov processes on S"^, where S denotes a finite set. Sites 
X are elements of Z, states s are elements of S and configurations ^ = {S,{^))xe'i. 
are elements of S'^. The space C := C([0, -f oo[, S'^) is the space of cadlag functions 
on [0, -|-oo[ with values in S^. For every nonnegative time i, : C — > 5^ is the 
canonical coordinate map on C, hence Xt{Lo) Lo{t) for every uj in C. The space C 
is endowed with the cylindrical cr-algebra (T((Xt)t^o)- For every nonempty subset 
B of Z, ttb ■ ^ is the canonical projection defined by 

7rs(0 {£.{x))xeB- 

For every site x, := n^^}- For every site y, dy : ^ is the canonical 
translation of defined by 

For every configuration ^, site x and state s, the configuration ^^'^ is defined by 
^^■^(x) := s, and S.'^-^iy) := ^(y) for every site y ^ x. 
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Finally, M+ := [0, +oo[, M_ :=] - cx3,0], and C(5^) is the space of the functions F 
defined on S*^ such that the following series converges: 

5]sup{li^(r^)-i^(01;Ce5^se5}. 

2.2. Specifications by transition rules. Recall that one is given a finite list of 
transition rules 

indexed by a finite set 3. For every i in 3, the rate of the rule TZi is a nonnegative 
real number and its context Ci = (A^, ti, Si) is characterized by a finite subset Ai of 
Z, a subset £i of S^^ and a state Si in S*. 

The list of rules yields a definition of the dynamics of the process through its 
infinitesimal generator £, as follows: for every function F in C{S'^) and every 
configuration ^, 

{x,i) 

where the sum enumerates every site x in Z and rule index i in 3. Here and below, 
we adopt the convention that l{7r0(^) G 0} = 1 for every ^ in S^. 

The fact that the generator £ indeed defines a Feller Markov process (^t)t>o on 
is a standard result, see [TD] for instance. For every configuration ^, denotes 
the probability measure on C which corresponds to the initial condition Xq = ^. 

Distinct families of stochastic transition rules may lead to the same infinitesimal 
generator £. As a consequence, the probability measures do not uniquely de- 
termine ^R in general, and several families of rules are compatible with the same 
Markov process. However, distinct families of rules do lead to distinct versions of 
the construction presented in the next section, so the coupling properties of this 
construction that are of essential use in this paper, may differ substantially from 
one family of rules to another, even when the corresponding infinitesimal generators 
are the same. 

2.3. Dynamics based on Poisson processes. The infinitesimal generator £ de- 
fined above is the usual way to specify a dynamics from a finite collection of tran- 
sition rules. However, the coupling times that we consider in this paper are formu- 
lated in terms of another construction, the so-called graphical construction of the 
dynamics, see [lOj , which is based on a family of Poisson processes corresponding 
to transition times. 

We now describe this construction in details. Since we will be interested in coupling 
from the past, we only have to define the dynamics of the process for the "past" 
times t ^ 0. 

2.3.1. Poisson processes. The individual sample space for the Poisson processes is 
the set 

^0 {{tk)k^i C M ; Vfc ^ f , tk+i < tk < 0, lim i„ = --oo}. 

n— i-+oo 
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We equip fio with the cr-algebra induced by the product Borel cr-algebra on the 
space of real valued sequences. For every fc > 1, the coordinate map Tfe : fio — > K 
is defined by 

T)s((t„)„^i) := ffc. 

For every site x in Z and rule index i in 3, is the probability measure on (Oq) -^^o) 
such that the sequence {Tk)k^i is a Poisson process on IR_ with rate rj. 

To define the dynamics of interest, we introduce a family of processes on the sample 
probability space 

(fi,:f,P) :=(g)(no,^o,Pt), 

{x.i) 

where the ® product enumerates every site a; in Z and every rule index i in 3. For 
every a; in Z, i in J, A; > 1, and V in one defines 

*(a;,z,A;)(V) := Tk{il){x,i)), *(a;,i)W ■=il^{x,i), 

and 

With these notations, '^{x,i) = {^{x,i,k))k^\. In the sequel, \l/(a;,i) denotes also 
the random set {^'(x, i, fc)} and the context should make clear which one of 

these two notations is in use. The same convention applies to ^{x) and Finally, 
T{x, i) is the sub-cr-algebra of T generated by 'i{x,i). 

Remark 2.1. In our context, it is necessary to use an indexation of the various 

random variables ^ and of related quantities by rule indices i in 3 instead of an 
indexation by the rules Ti in'OX themselves. To see why, consider the case when two 
rules TZi and TZj with j in 3 are described by the same contexts q = Cj and the 
same rates r-i = . Then TZi = TZj i'ut we want to consider the addition of their 
two effects, which could also be described by the single rule with context Ci and rate 
2ri. 

2.3.2. Flows. Let fii C denote the event that 'i'{x,i,k) ^ '^{x',i',k') for every 
{x, i, k) ^ [x' , i', k'). Then P(r2i) = 1, that is, almost surely, for each time t ^0, t 
belongs to exactly one set \E'(a;, i, k) or t belongs to none of them. 

Definition 2.2 (Direct influences). The direct influence process is the random 
process Influ defined on R_ x Z as follows. Let t ^ and x in Z. // 1 belongs to a 
unique set ^{x,i,k), let 

Influ(t) := {t} X {x + Ai). 
Otherwise, let Influ(t) := {(t,x)}. Conversely, for every site x, let 

Influ" ^(t,x) := t. 

When Ai is empty, this definition implies that Influ (t) is empty. 

Definition 2.3 (Preceding times). For every times u < t Q and. site x, the 
preceding time Precede (t, x) at x after u and before t is the random variable with 
values in M_ x Z defined by 

Preced„(t,a;) := (sup]w, t[n'S'(a;),a;) , 

with the convention that Preced„(f, x) := {u,x) if]u,t[n^{x) is empty. 
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Definition 2.4 (Multilevel influences). For every site x and times u < t ^ 0, we 
define inductively a sequence {ln&nk{u,t,x))k^o of random sets, called the influ- 
ences of site X after u and before t at level k, as follows. 

• For k = 0, let Influo(M, t, x) := Influ(t, x); 

• For every fc ^ 0, 

Influfe+i(ii, t, a;) :— Influfc(u, t, x) U Influ(Preced„(Influfe(u, t, x))). 
Additionally, the complete influence of site x after u and before t is 

-f oo 

Influoo(M, a;) := [J lTiQMk{u,t,x). 

k=0 

Let C r2 denote the event that lu&nQo{u,t,x) is a finite set for every site x and 
couple {u,t) of times such that u < t ^0. Then P(il2) — 1 (see [TT] page 142, for 
instance) . 

We define a random flow $ on 5^ x M x M x Z, such that, for every u ^ t ^ 0, 
<i>(^, u, t, x) is the x-coordinate of the configuration at time t which one obtains by 
applying the moves described by 5* to the configuration ^ at time u. 

The definition of $ is based on the following recursive procedure. 

Assume first that ili n ^2 holds. Fix a site x and times u ^ t ^ 0. If w = t, let 
^{^,u,t,x) := (,{x). li u < t, consider first the case where t is not in ^'(a;). Then 
Influ(<, x) — {{t,x)}, and we use a recursive call to the definition of <i> by letting 

<i>{C,u,t,x) := $(C, Precede (t, a;)). 

Otherwise, t = ^{x,i,k) for exactly one rule index i in 3 and one index fc ^ 1. 
Consider then the set Precede (Influ(t, a;)). If this set reduces to {{u,x)}, then, for 
any (i/,t) in Influ(t,a:), no rule applies at site y between the times u and t. Then, 
let x{y) '■= C(y)- Otherwise, Precede (Influ(t, a;)) is not reduced to {{u,x)}. For 
every element (y, t) of Influ(t, a;), we use a recursive call to the definition of $ and 
let 

Xiy) ■= 'i>{tu,PTecedu{t,y)). 
Then, if x{x + Ai) belongs to £{ (remember that this is automatically the case when 
Ai is empty), let $(^,u,i,a;) := Si, and say that ^'(a;,i,fc) is performed when one 
starts from configuration ^ at time u. Otherwise, let ^{S,,u,t, x) :— ^(x) and say 
that ^'(a;, i, k) is not performed when one starts from configuration ^ at time u. For 
the sake of definiteness, if ili fl does not hold, let (f>(^, u, t, a;) := £,{x), and say 
that ^!{x,i,k) is not performed, whatever the value of {x,i,k) is. This ends the 
description of the construction of <&. 

The fact that, on fii n O2, Influoo(it, a:) is a finite set, guarantees that the above 
procedure involves only a finite number of recursive calls to the definition of $ and 
leads to a consistent definition of <&. Moreover, one can check that the fact that 
"^{x, i, k) is performed or not does not depend on the value of t, but only on ^, u, 
and, of course, VP. The proof of the proposition below is standard. 

Proposition 2.5 (Flow properties). The flow $ enjoys the following properties. 

• For every times u < v < t ^ and site x, 

$($(C, U, V, •), V, t, x) = $(C, u, t, x). 
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• For every time u < 0, the distribution of the cddlag random process 

is the distribution of {Xt)o^t^-u with respect to Q^. 

A motivation to give the details of the construction of $ was to be able to define 
the following random variable. 

Definition 2.6 (Performance indicator). The performance indicator of rank k ^ 1 
for the rule index i at site x, starting from configuration ^ at time u, is 

Perf u, X, i, k) := l{^'(a;, i, k) is performed when starting from (, at time u}. 

2.3.3. Measurability and shifts. For every time t ^ and rule index i in 3, let 
Kt{x,i) denote the random variable on P) defined by 

Kt{x, i) := max{k > 1 ; ^{x, i, k)^t}LI {0}. 

For every time t < and site y, the space-time-shift ai,y is defined on O by 

[(Tt.y{'^)]ix,i,k) := ^ {x + y,i,k + Kt{x + y,i)) - t. 
Then P is invariant with respect to every <Jt^y and 

IninOg ^ lninn2 ° ^t,v 

The behavior of the flow under the action of the shift is described by our next 
lemma, whose proof is left to the reader. 

Lemma 2.7. On fii fl O2, for every sites x and y and times u < u < and f < 0, 

$(^, u, V, x) o (Tt,y = $(^, u + t,v + t,x + y), 
and, for every rule index i inJ and index k^ 1, 

Perf(^, u, x,i,k — Kt{x, i)) o ut^y = Perf u + t,x + y,i,k). 

Let r := {n} x M" and J"^ the cr-algebra on T generated by the sets {n} x B, 
for every n ^ and every Borel subset B of M". 

Definition 2.8. For every time t ^ 0, let T^{t) denote the sub-a-algebra of 
generated by the family of maps 6f{x,i) : Q — > F, for every site x in Z and rule 
index i in 3, defined by 

e+{x,i) := {Kt{x,i); {^{x,i,k); 1 ^ k ^ Kt{x,i)}) . 

More generally, if U is a random variable defined on with values in IR_, 

J^~^{U) denotes the sub-a-algebra of T generated by the maps 9^{x,i). 

One can view J^~^ {t) as the cr-algebra of the events posterior to the time t. 
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2.4. Coupling times with ambiguities. 

Definition 2.9. Let H — {H{x,i,k) ; x G Z, i ^ 3, k ^ 1} denote a family of 
random variables, defined on {Q,J-^¥) and with values in {0,1}. Then VtiJ is the 
subset o/M X Z defined as 

:= {(^(x, I, /c), a;) ; X G Z, i e 3, fc ^ 1, H{x, i, k) = 1}. 

Likewise, 

Perf u, H) := {Perf u, x, i, k)H{x, i,k) ; x e Z, i e 3, k ^ 1} . 
Let JC denote the product a-algebra on {0, i|^x3x{i!2,...}^ 

Definition 2.10 (Coupling time with ambiguities). The pair {H,T) is a coupling 
time with ambiguities if H = {H(x,i,k) ; x Z, i £ 3, k ^ 1} is a {0,l}-valued 
process and T is a random variable defined on {fl,J-,V), such that the following 
holds. 

(1) The random variable T belongs to ] — oo,0[, P almost surely. 

(2) At most a finite number of the random variables H{x,i,k) are not zero, P 
almost surely. 

(3) For every site x, rule index i and index k ^ 1, if ^(a;,i,fc) < T, then 
H{x,i,k) = 0, P almost surely. 

(4) For every site x, rule index i and index k ^ 1, H{x, i, k) is measurable with 
respect to J-~^{^{x,i,k)). 

(5) For every time t < T and configurations ^ and if PeTf{^,t, H) and 
Perf{^',t,H) are equal, then $(^,t, 0~,0) and (f>(^', i, 0~, 0) are equal, P 
almost surely. 

Remark 2.11. A consequence of definition \2.10\ and of the flow property of $ 
described by proposition \2.5\ is that, if {H, T) is a coupling time with ambiguities, 
there exists a map O : f2 x {0, i|^x3^x{i'2,...} _^ g such that, P almost surely, for 
every t <T , 

4>(C,i,0-,0) = e(vI/,Perf(^,t,i7)). 
Note that Q does not depend on t. 

Definition 2.12 (Width of couphng times with ambiguities). The width of a cou- 
pling time with ambiguities {H,T) is bounded by the couple (a_,a+) of nonnegative 
integers if the following holds. 

(1) The random process H — {H{x,i,k) ] x E Z, i E 3, k ^ 1} is measurable 
with respect to a(J-{x, i) ; — a_ ^ a; ^ i E 3). 

(2) Influ(\E'i?) C [-a_,a+] x R_, P almost surely. 

(3) The map Q in remark \2.11\ can be chosen to be measurable with respect to 
the a-algebra a {T(x, i) ; — a_ ^ x ^ a+, i G 3) ^ IC. 

Definition 2.13 (Growth parameter). The growth parameter of a coupling time 
with ambiguities {H, T) is 

m{H,T) :=E[#biflu(*iJ)]. 
Ifm{H,T) < 1, we say that the coupling time with ambiguities {H,T) is subcritical. 



10 



JEAN BERARD, DIDIER PIAU 



Note that 

#Influ(vI/i7) = J2 {#A,)H{x,i,k). 

Definition 2.14 (Laplace transforms of coupling time with ambiguities). For every 
real number X, introduce A7^(A) :— K[e~'^'^] and 

Aff (A) := E I ^ e-^'l{it, y) e Influ(*F)} . 

\{t,y) ) 

Recall that T < almost surely and that t < for every (t, y) in Influ(^'iJ). Note 
that At(0) = 1, Kh{0) = m{H,T), and 

^e-^*l{(t,2/)eInflu(«'iJ)}= Yl m,)o-^''^''^'''^H{x,i,k). 



3. Statement of the main results 
We are now able to state the main results of this paper. 

Theorem A (Ergodicity). // there exists a subcritical coupling time with ambi- 
guities, the particle system is ergodic. That is, there exists a unique invariant 
probability distribution fi, and, for every initial configuration converges in 

distribution to fi as t goes to infinity. 

Theorems IB] and [P] below provide non-asymptotic results. Theorem [Cl is a conse- 
quence of theorem iBl 

Theorem B (Explicit bound). Assume that there exists a subcritical coupling time 
with ambiguities (H, T) and let fi denote the unique invariant probability distribu- 
tion of the particle system. For every configuration ^, finite subset of sites B <Z "L, 
and time t ^ 0, the distance in total variation between the distributions 7rB(A"|) 
and 7rB(/i) is at most 

#S X inf f m{H,TY + inf AH(A)'=AT(A)e-^* ) . 

Theorem C (Exponential rate of convergence). Assume that there exists a sub- 
critical coupling time with ambiguities {H, T) with finite width such that T is ex- 
ponentially integrable. Then, with respect to the total variation distance, for every 
initial configuration , the finite marginals of converge exponentially fast to the 
finite marginals of the invariant distribution. 

Theorem D (Decay of correlations). Assume that there exists a subcritical coupling 
time with ambiguities with growth parameter m < 1 and finite width bounded by 
(a_,a_|_). Let fi denote the unique invariant probability distribution of the particle 
system. For every real number z, let k{z) :— |z|/(a+ -I- a_). 

For every sites x and y in Z, the distance in total variation between TT^^ yjlp) and 
T^xifJ-) "8) T^yip) = T^oip) 8) 7ro(/i) is at most 2m'^^^^'^^^"^. 
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Let X denote a positive integer, B := Jj C\ [x, +cxd) and C := C] (— oo,0]. The 
distance in total variation between ttbucIa*) cind TrsifJ-) ® T^cif) is at most 

( \ + \ ^ 

As a consequence, the same hound applies to every subsets B and C ofZ such that 
min B ^ X + max C . 

4. Proof of the main results 

In section im we define a crucial tool for our proofs, namely the notion of ambiguity 
processes. In section we explain how to control these. This enables us to prove 
theorem 13 in section 231 theorem [B] in section and theorem [D] in section 
Section is a preparation to the proof of theorem [Cl given in section H771 Finally, 
section settles some measurability issues. 

4.1. Ambiguity processes. For every sites x and y, time t ^ 0, rule index i inJ 
and index fc ^ 1, let 

H{t,x){y,i,k) := [H o at,x){y ~ x,i,k - Kt{y,i)). 

For every site x and time t ^ 0, let 

H{x,t) {H{x,t){y,i,k) ; y e Z, i £ 3, k ^ 1}. 

Similarly, let 

T(x,t) := {Toat,,)+t. 
In words, the pair {H(t,x),T(t,x)) corresponds to the translation of the coupling 
time with ambiguities {H,T) from site and time 0~ to site x and time t^ . 

Definition 4.1. For every site x and time t ^0, let A{t,x) denote the event that, 
for every time u < T(x, t) and configurations ^ and such that Perf u, H{t, x)) 
and Peif^^' ,u, H{t, x)) coincide, ^{^,u,t~ ,x) and ,u,t~ , x) coincide. 

Let be the event 

173:= ^(0,0) n fl A{-^{y,t,k),x). 

Lemma 4.2. (1) For every site x and time t ^Q, ¥[A{t,x)] = 1. 
(2) For every sites x and y, rule index i and index k ^ 1, 

F[A{^iy,i,k),x)] = 1. 

Proof of lemma \4^ Part (1) is a simple consequence of lemma l2.7l in section [2.3.31 
and of the fact that P is invariant under the action of at^x- We omit the details of 
the proof. 

As regards part (2), by our definition 12.101 above, there exists a set B in T such 
that B C A{0,0) and P[B] = 1. With our definitions, for every x and t, 

fii n n {V' e ; at^xW e B} c A{t, x). 

Hence, we only need to prove that P[aT,x{'^) G S] = 1, where r := '^{y, i, k). 

By standard properties of Poisson processes, ar,x{^) is independent from r and has 
the same distribution as ^. As a consequence, letting T denote the distribution of r. 
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the distribution of (r, o'r,x(^')) on (—00, 0) x il equipped with the product cr-algebra 
B{{~oo, 0)) (g) J-, is equal to the product measure T (g) P. By Fubini theorem, 

P[cr^,r(«') eB]= f P[* e B] dT(t) = 1. 
This concludes the proof of lemma □ 



A direct consequence of lemma IT2l above is the following proposition. 
Proposition 4.3. Vp^] = 1. 

Definition 4.4 (Ambiguity processes). The ambiguity process at site x is a se- 
quence (Amb„(a;))„^o of random subsets of Z x M_ , defined recursively as follows. 

• Initialization: Ambo(a;) := {(0,a;)}. 

• Induction: Amb„+i(a;) :— |^ Influ(^'i7(t, y)). 

(t,y)eAmh„(x) 

This defines a nondecreasing random sequence of sets (Amb„(a;))„^o • Let 

Amb(a;) :— (J Amb„(a;). 

By construction, Amb(a;) is a finite set if and only Amb„(a;) — Amb„+i(a;) for some 
index n 5^ 0. For such an index n, Ambfc(x) — Amb„(x) for every k ^ n, whence 
Amb„(a;) = Amb(x). 

We wish to prove that for subcritical coupling times with ambiguities, the set 
Amb(a;) is almost surely finite. 

Definition 4.5 (Coupling time at a site). The coupling time T* at site x is 
T;:=inf{T(t,y);(i,y)eAmb(x)}. 

Observe that, if Amb„(a;) — Amb„+i(a;), then 

T; :=inf{T(i,y); (t,y) e Amb„(a;)}. 

Definition 4.6 (Influence in the ambiguity process). Let (t,y) and {u, z) denote 
elements of Amh{x). Say that {u, z) is influenced by (t,y) if{u,z) belongs to the 
set Influ(*H(t, y)). 

Note that an element of Amb(x) may be influenced by several elements of Amb(a;). 

We define inductively a sequence {Zn{x))n^o of random subsets of Amb(a;) as fol- 
lows. For n — 0, let Zq{x) := {{0,x)}. For every n > 0, Z„+i(a:) denotes the set 
(possibly empty) of the elements influenced by elements of Zn{x). Hence, 



Amb(a;) = |J Z„(x). 



n>0 



Moreover, if Zn{x) is empty for a given n ^ 1, then Zk{x) is empty for every k ^ n 
as well, and in that case, 



Amb(a;) = |J Zk{x). 
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Definition 4.7 (Locked sites). For every site x and times u < t ^ 0, say that 
(t^ , x) is locked by time u if for every time v < u and configurations ^ and ^' , 

Lemma 4.8. On fli n fl2, */ locked by time u < t, then for every times 

V < u and v' < u, and configurations ^ and , 

The proof of lemma [4?8l is a consequence of the flow property of $ in proposition [2l5] 
and we omit it. 

We now give a definition concerning ambiguities. 

Definition 4.9 (Resolution of ambiguities). One says that the ambiguity associated 
to (^'(x, i, k), x) is resolved by time u ifu < ^'(x, i, k) and if, for every configurations 
^ and £f , 

Pert (^, u, a;, i, k) ~ Pert (^', u, x, i, k). 

Proposition 4.10. //Amb(a;) is finite, then {0~,x) is almost surely locked by time 
t, for every t < T* . 

Proof of proposition \JJTJ[ Assume throughout the proof that fii n H holds, 
since this event has probability one. 

Consider an index n such that Amb2;(n) = Amba;(n + 1), and let {tk,Xk)i^k^r 
denote an enumeration of the set {Amhx{n))r\\ — oo, 0] such that 

ti <t2 < ■ ■ ■ <tr = 0. 

We wish to prove by induction that, for every 1 ^ q ^ r, with full probability, the 
following property (Pg) holds: 

(Pq) For every 1 ^ j < and every {tj,y) in Influ(tj, x^), {tj,y) 
is locked by time T*. 

Assume first that q — l and consider {ti,z) in Influ(i;i). 

By definition, Influ(\E'-ff (ti, 2)) C Amhx{n + 1) since {ti,z) belongs to Amb2,(n). 
But every element {w,u) in Influ(\l/ff(<i, z)) is such that u < ti, by definition 
of H and Influ. On the other hand, u > ii by the definition of ti. This is a 
contradiction, hence Influ(\l/i?(ti, z)) is empty. Using the definition of a couphng 
time with ambiguities and the fact that, by definition, T* ^ T{ti,z), we deduce 
that {t^ ,z) is, with full probability, locked by time T*. This proves {Pi). 

Assume now that (Pq) hold for some 1 ^ g ^ r — 1, and consider an element (ig+i, z) 
of lnflu(tg+i, Xg+i). Observe that iJ(ig+i,z) C {{ti, xi), . . . , {tg, Xq)}. As a con- 
sequence, according to (Pq), with full probability, the ambiguities associated with 
the elements of H{tq^i, z) are resolved by time T*. Thus, for every configurations 
^ and (f and time t < T*, 

Perf (t, e, H{tq+i,z)) = Pcrf (t, i7(tg+i, z)). 

Using the fact that by definition T* ^ r(tg_|_i, z), and the definition of a coupling 
time with ambiguities, one sees that {t'^^-^^,z) is locked by time T*. Hence the 
ambiguity associated with (ig+i,Xg+i) is resolved by time T*, and (Pq+i) holds. 

The proof of proposition 14. 101 is complete. □ 
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Remark 4.11. The reader might have noticed that property in definition \2.10i 
in section \2.4\ is not used in the proof of proposition \4-10\ above. However, this 
property plays a crucial role in the estimates on the ambiguity process presented in 
the next section. 

4.2. Controlling ambiguity processes. The goal of this section is to prove the 
preliminary estimates of lemma 14.121 below. For every A and nonnegative integer 
n, let 

C{n,X):= Yl e"^'' 
{t.y}ez„{o) 

and 

Din,X):= J2 e-^^(*^«). 
(t.y)ez„(o) 

Lemma 4.12. For every A and nonnegative integer n, 

E[C(n,A)] < Ah(A)", 

and 

E[D(n, A)] = AT(A)E(C(n, A)) At(A)Ah(A)". 

Proof of lemma \4.12\ The proof of the first assertion is by induction on n. For 
n — 0, C(0, A) = 1 hence the result is obvious. Assume that the result holds for 
a given n ^ 0. Every element in Zn+i{x) is influenced by at least one element in 
Zn{x), hence C{n + 1, A) ^ Co{n + FA), with 

Co(n + l,A):= ^ ^ e-^"l{(ii, z) G Influ(*i/(i, y))}. 

(t,iy)eZ„(0) (,u,z) 

Hence, 

Co(n+l,A)- ^ e-^*^e-^("-*)l{(M,z)eInflu(*i7(t, ?/))}. 

Let 

Ci{n, A, X, y, i, k) l{(y, i, k)) £ Z„(0)} e-^*(-'^''=), 

and 

C2(A, X, y, I, k) e-^("-*(-'*''=)) l{(u, z) G Influ(vI/iJ(vI/(x, A:), y))}. 

(ti,z) 

The last expression of Co(n + 1, A) can be rewritten as 
Co(n+l,A)= Y E 

{x,i,k) y^x+Ai 

Taking expectations on both sides, 

E[C7o(n + l,A)] = nCi{n,\x,y,i,k)C2{\x,y,i,k)]. 

(x,i,k) y£x+Ai 

According to lemmas [4.161 and 14. 171 in section below, every Ci{n, X, x,y,i, k) is 
measurable with respect to J-'~^{'i/{x, i, fc)), while the conditional distribution of ev- 
ery C2(A, X, y, i, k) with respect to JF+(^'(a:, i, k)) is the same as the (unconditional) 
distribution of 

Y e~^"l{(u,z) e Influ(«'iJ)}. 

{u,z) 
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As a consequence, 

E [Ci(n, A, X, y, i, A;)C2(A, x, y, i, /c)] = A//(A) E[Ci(n, A, x, y, i, A;)]. 
In turn, this implies that 

E[Co(n + l,A)]=AH(A)Ej ^ J2 Ci{n, X,x,y,i,k)\ . 

It remains to notice that 

^ ^ Ci{n,X,x,y,i,k) ^ C{n,X), 

(x,i,k) y£x+Ai 

to see that the induction on n is complete. The proof of the first assertion of 
lemma l4.12l is complete. 

As regards the second assertion, fix an integer n ^ 0, and define 
C3(A, x, y, I, k) e~^iTmx,^.k))^■i'{x,^.k).y) ^ 

Using the functional Ci defined in the proof of the first assertion, one sees that 

D{n,X)^ ^ ^ Ci{n,X,x,y,i,k)C3{X,x,y,i,k). 

(x,i,k) yEx+Ai 

As a consequence, 

E[Din,X)]= J2 J2 nCiin,X,x,y,i,k)C3iX,x,y,i,k)]. 

{x,i,k) y£x+Ai 

As in the proof of the first assertion, we observe that, for every fixed {x,i,k,y), 
the random variable Ci(n, A, x, y, i, k) is measurable with respect to J^^{^{x, i, k)), 
while the conditional distribution of T{y,^{x,i,k)) — \E'(a:,z,fc) with respect to 
J-^{'^{x, i, k)), is the same as the (unconditional) distribution of T. Using the last 
displayed identity in the proof of the first assertion once again, the result follows. 

The proof of lemma H.12l is complete. □ 

4.3. End of the proof of theorem [Aj Lemma 14.121 with A = shows that, 
for every n ^ and every site x, E[#Z„(x)] ^ A//(0)" = m(if, T)". Markov 
inequality yields P[Zn{x) 0] ^ m{H,T)". In particular, there exists P almost 
surely an integer n such that Zn{x) is empty. For such an integer n, 

Amb(x) - U Zk{x), 

O^fe^n-l 

hence Amb(a;) is finite with full probability. 

Fix a finite subset i? C Z, and let := minjT* ; x G B}. By proposition 14. 101 in 
section ICT for every time t < T* and configurations ^ and 

The probability that time belongs to "if is 0, hence, almost surely, for every time 
t <T* and configurations ^ and 

7TBmc,t,o,x)) = 7:Bmc,t,o,x)). 

As a consequence, for every positive time t and configurations ^ and the distance 
in total variation between ttb{X^) and nsiX^ ) is at most P[T^ < —t]. Since every 
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min T(u, x) ^ t 

(u,x)G2fc(0) 



T* is almost surely finite and B is finite, Tg is almost surely finite, hence this 
distance goes to when t goes to infinity. 

Now, a generic compactness argument shows that, as a consequence of S being 
finite, there exists at least one invariant distribution fi for the particle system (see 
for instance [lOj chapter 1, proposition 1.8). Consider now a random configuration 
^' with distribution /x, and an arbitrary configuration ^. For every t ^ 0, the 
distribution of is ^, and the previous estimates then show that, as t goes to 
infinity, converges in distribution to /i as i goes to infinity. This ends the proof 
of theorem [XI 

4.4. End of the proof of theorem |B1 By the union bound, for every negative t, 

V[Tb s^t]s^Y^ V[T* s^t] = {#B) P[To < t]. 

xeB 

Fix an index n ^ 1. On the event that Z„(0) is empty, 

Tq* = min{r(-it, x); k i^n- 1, {u,x) e Zk(0)}. 
As a consequence, by the union bound, 

n-l 

P[ro* <t] <m(i/,T)" + ^I 
Fix A ^ 0. Then 

exp(— A min T(u, x)) ^ D(k. X). 

Hence, Markov inequality yields 

min T(u, x) i < e^*E\D(k, A)] < e^* Ah(X)'' AtM, 

where lemma |4. 1 21 provides the last inequality. Taking the infimum with respect to 
A ^ in this inequality, separately for each ^ /c ^ n — 1, yields the conclusion of 
theorem [BJ 

4.5. Proof of theorem [Cl By hypothesis. Ay (A) is finite for some positive values 
of A and the width of {H, T) is finite, hence our next lemma implies the theorem. 

Lemma 4.13. Assume that the width of {H,T) is finite and that A7^(A) is finite 
for a positive A. Then Ah{X') is finite for every A' < A. 

Proof of lemma \4-13\ Replacing every H{x,i,k) such that — a_ ^ a; ^ and 
^(x, i, fc) ^ T by 1 in the expectation which defines Ajj(A') yields 

—a-^x^a^ i 

with 

Ajf(a;,z,A') := e-^''l{t>T}. 

Fix a positive real number A". Then, e~^'^l{t > T} ^ q-\"t+{\" ~\')t ^ hence, for 
every site x and rule index i, 

te*(a;,i) 



COUPLING TIMES WITH AMBIGUITIES FOR PARTICLE SYSTEMS 



17 



Fix p > 1 and q > I such that l/p+ 1/q ~ 1. By Minkowski inequaUty, 

k^l 

Since each —'^{x,i,k) is the sum of k i.i.d. exponential random variables, the last 
sum is the sum of a geometric series, with ratio E[e''('^ ~^ )'^{x,^,l)'^'^/q ^ Assume that 
A" > A'. The ratio is less than 1, hence the sum over k ^ 1 converges. 

Summing this over every site — a_ ^ x ^ a+ and rule index i yields a finite upper 
bound of Afl-(A') as soon as one can find A" > A' and (p, q) such that l/p+l/g = 1 
and E[e~^'*' is finite. If At(A) = E[e~''*"^] is finite, this is possible for every A' < A, 
hence the proof of lemma 14.131 is complete. □ 

4.6. Preparation to the proof of theorem [Dl 

Definition 4.14. For every site x, let N*{x) denote the, almost surely finite, 
smallest integer n such that Zn{x) is empty, and let Qx denote the a-algebra 

Qx (j{T{y, i); x + a_N*{x) ^x + a+N*{x), i e 3). 

Lemma 4.15. For every site x, there exists a random variable Fx, with values in 
S, which is measurable with respect to Qx and such that, P almost surely and for 
every configuration ^, 

lim (^{^,t,{),x)^Fx. 

t — > — oo 

Proof of lemma \4-15\ Assume throughout the proof that fii n H holds, since 
this event has probability one. Let (t^, a;fc)i^fc^r denote an enumeration of the set 
Influ"\Amb2,(iV*(a;)))n] - oo, 0] such that ti < t2 < ■ ■ ■ < tr = 0. 

For every 1 ^ m ^ r — 1, define {x„i, z,„, fc„j) by the relation 

tm — • ^{-^ni: im: ^m)- 

We define by induction a sequence (Wm)o<i^r-i such that, for every m, 

Wm e {0,lf X3X{1.2...}. 

First, let WQ{x,i,k) := for every x, i and k. Now, let 1 ^ m ^ r — 1. For every 
g ^ m - 1, let Wjnixq,iq,kq) := Wm-i{xq,iq,kq). For g = m, let 

m; ^m) ; y ^ ^~ ^m] ^ ^m} ■ 

Finally, let Wm{x,i,k) :— when {x,i,k) is not one of the triples {xj,ij,kj) for 
1 j s^m. 

As in the proof of proposition 14.101 we have that, for every g ^ r — 1, P almost 
surely, for every time t < T* and for every {tq+i,y) in Influ(tq_|_i, Xq+i), "^H{tq+i, y) 
is a subset of {(ii, xi), . . . , {tq, Xq)}, and that 

t, t-^„y) = e(Perf (e, t, H{tq+^,y)), Ot^^.J;^)). 

Hence Peif{£^, t, H{tq,y)) — Wq-iH{tq,y) for every 1 ^ q ^ r and P almost surely. 
Let 

Fx :=e(W,„iH(x,0),ao,.(*)). 
One sees that, P almost surely and for every time t < T*, 

$(C,i,0,x) = $(C,t,0-,x)=F,. 
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Finally, the measurability properties of Fx follow from assumption ([3|) in defini- 
tion [2TT21 The proof of lemma 14.151 is complete. □ 

4.7. Proof of theorem |Dj Assmiie that minS = n + maxC with rt ^ 1, and 
introduce the real number 

z :— maxC + n — — = maxC + K(n)a_|-. 

a+ + a_ 

Let and denote two families of Poisson processes indexed by Z and such 
that the following properties hold. 

• 'i'cix) = '^'(x) for every x < z and ^^(a:) = "^{x) for every x > z. 

• {'^cix))x^z lias the same distribution as {'^{x))x^z but is independent from 

• {'i' Bix))xi^z has the same distribution as {'^{x))xi^z but is independent from 

Hence and ^'c are independent. Recall lemma [4.151 and definition 14.141 in 
section 14.61 and let 

Nc = snp{x + a+N*{x); x e C}, 

and 

Nb = inf{x - a_iV*(x); x e B}. 

Consider the event 

E {Nc <z< Nb}. 

On E, {{Fx{^))xec, (F,(*))xeB) is distributed like ((F,(*c))x6C, (F,(*B)).es)- 
Since '^c and 'i's are independent, {Fx{'if))x£C and {Fx{'if))x^B are independent 
on E. Hence the distance in total variation we want to estimate is at most P[ri\i?]. 

Note that n\E = {Nb < z} U {Nc ^ z}. As regards Nb, 

{Nb ^z}^ a-N*{x) ^ z}, 

xeB 

hence 

P[A^B ^ z] ^ F[x-a-N*{x) iS,z]. 

x^min B 

For each x ^ mini?, {x - a^N*{x) ^ z} = {N*{x) > K{n) + k/a-}, provided k 
is the nonnegative integer k := x — min_B. Furthermore, for every real number v, 
¥[N*{x) ^v]s^ m"^^ with m 'm{H, T) < 1. This yields 

^ ™K(n) — 1 

FWb ^z]^y m^M-^+k/a- ^ _ 

fc^O 

The same argument, applied to ¥[Nc ^ z], and the union bound, yield the result. 

The case when C = {x} and B = {y} with y > x is similar, except that one can 
replace the geometric series involved above by their first term, hence the tighter 
bounds. 

The proof of theorem [D] is complete. 
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4.8. Measurability properties. Let fi{H) denote the random counting measure 
associated to ^iJ, that is, 

Lemma 4.16. Fix sites x and y, a rule index i, an index k ^ 1, and a nonnegative 
measurable function : R — > R_|_. Then the random variable j Fdfi{H oa<s,(^x.i,k),y) 

is independent from J-^('i'{x,i,k)) and has the same distribution as J Fdii(H). 

Proof of lemma \4-l(>] The independence property is a consequence of the indepen- 
dence of cr^{x,i,k),y and J-^{'i'{x,i,k)). The equidistribution property is a conse- 
quence of the invariance of P with respect to <y^(x,i,k),y ^ 

Lemma 4.17. For every integer n ^ 0, the event {\E'(x,i,fc) e Zn{x)} is measur- 
able with respect to i, /c)). 

Proof of lemma \4.17\ We proceed by induction on n ^ 0. The case of 7i = is 
included in the definition of a couphng time with ambiguities. Let n ^ 1. By 
definition, {^(a;, i, k) G Zn{x)} is the union of the events 

E{x',i',k',y) := {^ix',i',k') e Z„_i(x), e H{y,^{x',i',k'))}, 

over every site x' , rule index i' , index k' and site y in x' + Aii . 

In the rest of this proof, we use r and r' as shorthands for t := \E'(x,i,A:) and 
r' :— ^'(x', i', fc'), respectively. 

Lemma [4. 171 follows from claims HTTSl and |4 . 1 91 below . 

Claim 4.18. For every event G in !F^{t'), G n {r < r'} belongs to J-^{t). 
Claim 4.19. The event {r e H{t' ,y)] belongs to T+{t). 

Indeed, by the induction hypothesis, {r' e Zn-i{x)} belongs to T^{t'). Hence, 
using claim [i7T51 we see that {r' g Z„_i(x)} fl {r < r'} belongs to JF+(t). On 
the other hand, by claim HTTOl we see that {r G H{t' ,y)} belongs to J-^{t) too. 
Since, by definition, {r G H{T',y)} C {r < r'}, we finally deduce that any event 
E{x', i',k', y) defined above belongs to J^+(r). This ends the proof of lemma BT71 

□ 

Proof of claim l4^TE[ It is easily checked, using the fact that {r < r'} belongs to 
J-"+(t), that the family of the events G which share the property stated in the claim 
is a (T-algebra. By definition, !F^{t') is generated by events of the form 

{Kr'{xi,ii) = ki] n {*(a;i,n,fc2) G B], 

for every site xi, rule index ii, and indices fci and k2 such that ^2 ^ fci, and every 
Borel subset B of M. 



Now for every event G of this form, it is easy to check that G fl {r < r'} belongs 
to JP'+(t). Claim mH follows. □ 
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Proof of claim \4^T3{ Observe that {r G Hy^r'} is the union over every 1 ^ m ^ A:— 1 
of the events 

{H{x y,i,k — m) o ar',y = 1} fl {Kt-'{x, i) — m} n {r < r'}. 

Fix TO ^ fc — 1. The collection of sets G in — y,i,k — m)) such that the 

event 

{1g o <Jr',y = 1} n {Kr'{x,i) = m} H {t < r'} 

belongs to is a cr-algebra. On the other hand, the a-algebra ^^(r) is gen- 

erated by the events of the form 

{Kr{xi,ii) = fci}n{*(xi,zi,fe) G B}, 

for every site xi, rule index zi, and integers fci and ^2 such that ^2 ^ fci, and every 
Borel subset i? of M. For every event G of this form, the definition of a coupling 
time with ambiguities yields the fact that the event 

{lGO(Jr',y = 1} n {Kr'ix,i) = m}n {r < r'} 

belongs to jr+(r). Claim [m follows. □ 

5. Applications to nucleotide substitution models 

Most stochastic models of nucleotidic substitution processes assume that the various 
sites along a DNA sequence evolve independently. However, it is a well-known 
experimental fact that the nucleotides in the immediate neighborhood of a site can 
affect drastically the substitution rates at this site. For instance, in the genomes 
of vertebrates, the increased rates of substitution of cytosine by thymine and of 
guanine by adenine in CpG dinucleotides are often quite noticeable (typical ratios 
10:1 when compared to the other rates of substitution). Recently, various models 
that take such dependences into account have been proposed, see [21 mO [71 [HI [El Hi] 
for instance. Among these, the class of RN-|-YpR models of nucleotide substitution, 
introduced by molecular biologists, and studied mathematically in [3], enjoys some 
remarkable properties, such as the possibility to solve exactly for several quantities 
of interest, and the occurrence of a non-zero but finite-range dependence along the 
sequence. 

Since these models put restrictive conditions on substitution rates (see below) that 
may be only approximately satisfied in some actual biological situations of interest, 
it is biologically relevant to study the properties of nucleotide substitution models 
that are not in the RN-|-YpR class but close to some models in this class. From 
a mathematical perspective, it is interesting to study what becomes of the depen- 
dencies along the sequence when small perturbations of the RN-|-YpR assumptions 
are introduced, thus destroying the special mechanism leading to finite-range de- 
pendence in the RN+YpR context. 

We apply the coupling techniques described in the rest of the paper to a generic 
family of perturbations of models in this class. In section [?7T1 we describe the class 
of models. In section 15.21 we introduce two coupling times with ambiguities and 
we state theorem [El our main result about these. The proof of theorem [E] is in 
section 15.31 In section 15. 4| we state some remarks. In section 15.51 we compute 
the growth parameters associated to these coupling times with ambiguities. The 
computations are based on some tree decompositions of conditional distributions. 
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stated in section 15.61 Finally, in section 15.71 we apply these results to the simplest 
non trivial example, namely the perturbed Jukes-Cantor model with CpG influence, 
thus proving the quantitative result stated as theorem [Fl 

We mention that the notations in this section are sometimes slightly at odds with 
those in the rest of the paper. 



5.1. Description of the models. 



5.1.1. RN+YpR models. Formally, these models are interacting particle systems 
with state space A^, where 

A {A,T, C, G} 

denotes the nucleotidic alphabet. The letters A and G correspond to purines, ab- 
breviated collectively by R, while C and T correspond to pyrimidines, abbreviated 
collectively by Y. Such a model is characterized by two sets of parameters, which 
describe two distinct kinds of transition mechanisms. 

The RN part of the model consists of a matrix S :— {sx,y)x,y£A of transition rates. 
The meaning of this matrix is that, when the state of a site is x, it is turned to y at 
rate Sx,y, independently of the other nucleotides. For S to be an RN matrix, some 
identities between coefficients must hold. Specifically, S must be of the following 
form: 
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The YpR part of the model is characterized by eight transition rates r|, indexed 
by all the couples {x,y) in ^ x ^ such that x and y are not both purines nor both 
pyrimidines. Thus the list of available YpR rates is 

ry^C „G ™A T" C „A G 

' Ai 'Ti 'C' 'G' 'G' 'T! 'C' ' a- 

To describe the meaning of these rates, we introduce the notations 

Y := {G, T}, 1a ■■= Y =: 1g, R := {A, G}, Ic ■= R =■ It, 

and 

A* := G, G* := A, G* := T, T* G. 

For every x in i? and every y in X^, if the state of a site is x and the state of its 
left neighbor is y, then the transition from x to x* occurs at rate r^. Similarly, for 
every xinY and every y in Ij, , if the state of a site is x and the state of its right 
neighbor is y, then the transition from x to x* occurs at rate r|. We refer to [3] for 
a thorough discussion of the properties of RN+YpR models. 

In the context of this paper, we use a specification of the dynamics by transition 
rules which is not the simplest possible one, mathematically speaking, but which 
enjoys coupling properties that are crucial in the sequel. We write a corresponding 
list of transition rules after another definition. 

Here is the list of transition rules. Recall that every rule TZ is of the form TZ = (c, r) 
for a rate r and a context c = {A, £, s). Accordingly, for every symbols a and f3, we 
write = (eg, rg) and eg = (Ag, sg). 
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• For every x in A, the rule TZ^ is defined by 

A^ := 0, £^ := 0, := x, := mm{v,,w,}. 

• For every x in the rule TZ^ is defined by 

A^ := {0}, C := := -i-, r}^ := (z;, - 

• For every x in A, the rule 7?.Jf is defined by 

Af := {0}, e := A\T,, := rf := K 



For every {x, y} = Y and every z in i?, the rule TZ 



Y 



IS 



,yz ■~ 

• For every {a;, y} = R and every 2; in Y, the rule T^-^^^j, is defined by 



■^xz,yz • {0)"!"-'-}) ^xz,yz • Vi '^xz,yz ■ ''y ^xz,yz ■ K-^' ^)}' 



■^zx,zy ■ { 1)0}) ^zx,zy ■ Vi '^zx,zy • '^y ^zx,zy ■ {(^)'^)}' 

In the following, the non-degeneracy condition (ND) holds: 

(ND) For every nucleotide x in A, Wx and Vx are positive. 

5.1.2. Perturbed RN+YpR models. We consider perturbations equivalent to the 
addition to the RN rules described above, of a generic matrix of substitution rates, 
that may not satisfy the RN property, and to the addition to the YpR rules, of 
generic neighbor-dependent transition rates, where the dependence is either to the 
right neighbor or to the left neighbor. 

Here is the list of perturbative transition rules. 

• For every distinct x and y in A, the rule TZ% y is 

• For every distinct x and y and every z in A, the rule TZ-lx zy 

"^zx.zy {^1; 0}, tzx,zy {(^) •^)}) ^zx.zy '■~ '^zx.zy ■~ ^{zx, Zy) . 

• For every distinct x and y and every z in A, the rule TZ%z,yz is 

-^xz,yz ■~ {0)+l}) ^xz,yz ■~ {(^)'^)}) ^xz,yz '■— Hi fxz,yz '■= s{xZ,yz). 

5.2. Two coupling times with ambiguities. 

Notation 5.1. For every site x and every subset K :=: {TZi ; i € J} of ^R with 
J C3, let 

^{x, K) := J) = IJ ^{x, i), ^{K) := M/( J), 
ieJ 

and 

r{K) :=^r,. 
ieJ 

By an abuse of notation, in the rest of the paper, we also use the shorthands 
'^{x,TZi) := ^{x,i) and "^{TZt) := for every i in 3. 
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Definition 5.2. Let 5+, 3o, 5-, 3+; 3'_ and Cp denote subsets of the rule set 9\ 
such that the sets 3+; 3o; 3- are not empty, and the sets 

3+n3V, 3-n3'_, 3on«p, 3+n<p, 3-n«p, 3'+n3'_n<p 

are all empty. Let 5 ■= (3+,3o,3-,3'+)3'_)- 

Let t-, to and t+ denote negative real numbers. Say that a coupling event based on 
3 occurs at site x — Q and at times {t-,t(y,t+) if the following holds. 

• t- <to and t+ < to. 

• t- belongs to "^{—1,5-), t+ belongs to "^{+1,5+) and to belongs to '^{0,5o)- 

• The sets *(-l,3'_)n]t_,to[ and *(+l,3+)n]t+,io[ are both empty. 

Let T3 denote the maximum of the times mm{t-,t+} such that a coupling event 
based on 3 occurs at times {t-,to,t+). Let denote the set 

Hf := (^-(-1, «P) U *(0, qj) U qj)) n [T3, 0[. 

Remark 5.3. When there exists at least one triple (t_,io,^+) which corresponds 
to a coupling event based on T3 is indeed a maximum since the set 

(l'(-l) U l'(O) U *(+l))n] min{t_, 0[ 

is finite. When there exists no triple {t-,to,t-^-) which corresponds to a coupling 
event based on "5, let T;^ := —00. 

We define two examples of coupUng events in the context of perturbed RN+YpR 
models, that we call YpR sensitive and YpR insensitive. 

Definition 5.4. YpR sensitive coupling events are based on the following choice of 
3 and <p. 

• 3o := {11^ ; ye A}. 

• 3- := {TZ^;y€R}U{TZ^;yeA}. 

• 3+ ■.= {n'^;yeY}U{n^;yeA}. 

• 3'_:= {1Zl,^y,;{x,y} = Y,zeR}. 
. 3'+:= {TZf,,,y;{x,y} = R,zeY}. 

• *P := {Ji-%x,zyi T^%z,yzi ^x,y i (-^J y , z) & A }. 

Definition 5.5. YpR insensitive coupling events are based on the following choice 
of 5 and 

• 3o := ; y G A}. 

• 3- := {n^,n^;yeR}. 
. 3+ := «, < ; y e Y}. 

• 3— • {^1^ I I ^x.y^ '^xZjyzJ '^zx,zy i ^ A, y E Y, Z A}. 

• 3+ •= {^y J ^y J '^x,yJ ^%z,yz^ '^zx,zy ' ^ .A, y (z R, Z & A}. 

• ^ := T^%z,yz' "^zxjzy ' {^^Vi^) ^ .A^}- 

Notation 5.6. We write T^en and -ffgen, respectively Ting anrf .ffins; and 
associated to a YpR sensitive, respectively YpR insensitive, coupling event. 

Here is our main result about these random variables. 
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Theorem E. The random variable (Tscn, Hgcn) defines a coupling time with ambi- 
guities, whose width is bounded by a+ = a- = 2, and such that Tson is exponentially 
integrable. The same assertions hold for (Tms, H-ms)- 

Remark 5.7. Due to the non- degeneracy assumption (ND), and to standard in- 
dependence properties of Poisson processes, Tgon o,nd Tins almost surely finite 
negative random variables. Properties and Q) of definition \2.10\ are also 

easy to establish, so the real issue is to prove property 0) and the exponential 
integrability. 

5.3. Proof of the main result. This section is devoted to the proof of theoremlEl 

5.3.1. Preliminary result. 

Notation 5.8. Let g denote the application which fuses the two purines together, 
and T] the application which fuses the two pyrimidines together, that is 

g{A):^R^:^g{G), g{C) -.^ C, g{T):^T, 

and 

V{A):^A, v{G):^G, v{C) := Y =: rj{T). 
For every times s < t, every configuration let 

0o(e, s, t, *) s, t, -1)), s, t, 0),vmt s, t, +1))) . 

Lemma 5.9. Assume that Perf t, iJscn) = Perf(^', i, i/gcn) for a time t < Tsen 
and for some configurations ^ and ^' . Then, P almost surely. 

The same statement holds if one replaces Tgcn o,nd -ffgon by Tins and Hi^g, respec- 
tively. 

From now on, we assume that Cli r\fl2 (~\ {Tgcn > — oo} H {Tins > — cxo} holds, since 
this event has full probability. 

5.3.2. Proof of the preliminary result for YpR sensitive coupling events. We study 
what happens if one starts at a time t < Tson from two initial configurations ^ and 

such that Perf(C, t, i/sc„) = Perf(e', t, i/so„). 

Let ti < ■ ■ ■ < tr denote an ordering of ^'(— 1) n [<_, to[. 

Claim 5.10. With full probability, for every t < Tson; for every index 1 ^ k ^ r, 
e($(e,t,ife,-l)) = e($(e',t,ife,-l)). 

If claim fS.lOl holds. g{^{£,,t,tQ , —1)) = gi(<i>(^', t, , — 1)), and this fact implies that 

e($(^,t,io,-i)) = e($(?',t,io,-i)). 

A symmetric argument shows that ri{^{£^,t,tQ, +1)) ~ 77(<i>(^', i, to, +1)). 

Finally, at site and time to, the definition of Tson implies that there is a rule of 
the form TZy , hence $(^,i,to,0) = ^{^',t,to,0) and lemma ISJl holds for Tgen- 



COUPLING TIMES WITH AMBIGUITIES FOR PARTICLE SYSTEMS 



25 



Proof of claim lS^TUi Induction on k. Start with ti = t-. By definition of Tscn, at 
site —1 and time t-, there is a point corresponding to a rule among TZ^ and TZ^ 
for any x and any purine y. Each rule TZ^ yields a nucleotide x for both initial 
conditions ^ and As regards the rules TZ^ and TZq, either there is a purine at site 
— 1 and time {t-)^ , in which case the rule is not performed, or there is a pyrimidine 
and the rule is performed. In both cases, there is a purine at site —1 and time i_. 
This proves that g($(^, <, i_ , — 1)) = ^?($(^', t, i_, — 1)), hence the claim holds for 
k = l. 

Now we assume that the claim holds for fc — 1 with k ^ r, hence t, i^T, —1)) = 

g{^{C ,t,t'^ , —I)) and we consider the effect of the rule apphed at time tk- Call 
this rule TZ. Several cases arise. 

• If 7^ is perturbative, Perf t, iJson) — Perf (^', i, i/scn), hence TZ is per- 
formed for both initial conditions ^ and or for none. In both cases, 

• The same reasoning holds if TZ is non-perturbative and of the form TZy . 

• If TZ ^ TZy, since g{^{^,t,t^ , -1)) = g{^{^' ,t,t^ , -1)) by assumption, 
$(^,i,t^,— 1) and $(^', t, i^, — 1) are both purines or both pyrimidines. 
Hence, TZ is performed for both configurations ^ and ^' or for none, and 
gm,t,tk,-l)) = gm\t,tk,-l)). 

• li TZ = TZ^ for a purine y, the application of TZ leaves g unchanged, since 
TZ can only turn an A to a G or vice-versa. 

• If 7^ = 7^f for a pyrimidine y, g^^, t, , -1)) = g{<^<,£.' , i, t^, -1)), hence 
TZ is performed for both ^ and or for none. 

• If 7^ = TZ^^ ^y, the application of TZ leaves g unchanged, since TZ can only 
turn an A to a G or vice-versa. 

• Finally, the definition of Tscn excludes the rules TZYz yz ■ 

This proves claim [STTOl □ 

5.3.3. Proof of the preliminary result for YpR insensitive coupling events. As in 
the proof for Tgen, let < • • • < denote an ordering of ^'(— 1) fl io[- 

Claim 5.11. With full probability, for every time t < Tins o,nd index 1 ^ k ^ r, 

gmt t, tk, -1)) = gmC: t, tk, -1)) = R. 

If claim fS.lll holds. gi(<i>(^, i, , — 1)) = £»(<i>(^', t, , — 1)), and this fact implies that 

gmtt,to,-l)) ^ gmc,t,to,-i))- 

A symmetric argument shows that r]{^{^, t, to, +1)) = , t, to, +!))■ 

Finally, at site and time to, the definition of Tins implies that there is a rule of the 
form TZy , hence $(^,t, io,0) = ^{£_' ,t,to,0), and lemma EH holds for Tins as weU. 

Proof of claim lKTl] Induction on fc. Start with ti = By the definition of Tins, 
at site —1 and time t_, there is a point corresponding to a rule TZy or TZy with a 
purine y. Hence g{^{^,t, i_, — 1)) = g{^{^', t,t^,—l)) is a purine. 

Let fc ^ r and assume that the claim holds for fc — 1, hence t, — 1)) 

and g{^{C,t,tk-i,~l)) coincide. Thus, g{<^{^,t,t^ , -1)) and i^, -1)) 
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coincide. Consider the effect of the rule apphed at time tk- Cah this rule TZ. 
Several cases arise. 

• Assume first that TZ is pcrturbative. If TZ is performed, TZ leads to a purine 
because, by the definition of Tins, every perturbative rule applied at times 
ti, . . . , i„, leads to a purine. If TZ is not performed, the induction hypothesis 
shows that we must as well have a purine. As a consequence, whether TZ is 
performed or not, t, tk, —1)) = Qi^{£,' , t, tk, —1)) is a purine. 

• The same conclusion holds if 7^ TZy or TZ^ , because in this case, by the 
definition of Tins, y must be a purine. 

• li TZ = TZ^ for a pyrimidine y, the induction hypothesis implies that TZ is 
not performed. 

• If 7^ = TZ^ for a purine y, whether TZ is performed or not does not affect 
the value of g. 

• li TZ — TZf^_^y, performing TZ has no effect on the value of g, since TZ can 
only turn an to a G or vice-versa. 

• Rules TZ^^ are not performed since i^, —1) and $((^',t, i^, —1) are 
both purines, by the induction hypothesis. 

This proves claim [57TT] □ 

5.3.4. Application to theorem\^ Let T denote Tson or Tins. Let ii < ■ • ■ < t,. denote 
an ordering of the set U *(0) U *(+l))n]io, 0[. 

Claim 5.12. With full probability, for every time t < T and index 1 ^ fc ^ r, 

MLt,tk,'^) = Md,tk,'i')- 

Claim [5J2l shows that property ([5]) in definition 12.101 holds. The boundedness of 
the width is then straightforward. 

Proof of claim [57711 Induction on k. For k — 1, this is lemma [579] Let k ^ r, 
assume that the claim holds for tk-i and call TZ the rule applied at time tk- Let Xk 
denote the corresponding site, hence tk is in \E'(xfc). 

• To begin with, if TZ is perturbative, TZ is performed for both initial condi- 
tions ^ and or for none, hence the claim holds for tk- 

• The same is true if 7^ = TZy for a given y. 

• If 7^ = TZ]^ for a given y, (?!)o(C, t, ifc-i, *) = (f>o{£.' ,t,tk~i,'^), hence, for 
X = —1, or -fl, <I>(^,t,t^,a;) and <I>(^', f^, a;) are both purines or both 
pyrimidines. This means that TZ is performed for both ^ and S,' or for none, 
and the claim holds for tk- 

• If TZ = TZ^ for a given y, several subcases arise. 

— If Xfe = — 1 and y is a purine, performing TZ has no effect on the value 
of g, since TZ can only turn an A to a G or vice- versa. 

— If Xfc = — 1 and y is a pyrimidine, TZ is performed for both ^ and ^' or 
for none, because g{^{£,, t, t^ , —1)) = g{^{C t, t^ , —1)). 

— If a^fe = +1, symmetric arguments hold. 

— If a;fe = 0, <i>(^, i, t^, -1) = $(^', t, t^, -1), hence TZ is performed for 
both ^ and or for none. 

This concludes the case when TZ — TZ^ for a given y. 
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• If 7?. is a rule TZ^^ and Xk = —1, performing TZ has no effect on tlie value 
of Q, since it can only turn an A to a. G or vice- versa. 

• If 7^ is a rule TZ^^y^ and Xk — —1, the fact that £)(<I>(^, i, t^, — 1)) and 

, -1)) are equal and the fact that $(^,t,i^,0) and <i>(^', t,t^ ,0) 
are equal ensures that TZ is performed for both ^ and or for none. 

• If 7^ is a rule TZf^^^y or a rule TZ^^ y^, and Xk = +1, similar arguments hold. 

• If Xfe = 0, the facts that — 1)) and g{^{^' ,t,t^ , —1)) are equal, 
that <I>(^, t, ,0) and $(^', t, ,0) are equal for a rule of the form TZf^^y, 
and the facts that r/($(^, t, i^T, +1)) and r/($(^', i^T, +1)) are equal and 
that <i>(^, i, tjT, 0) and , 0) are equal for a rule of the form TZ^^ y^ 
ensure that TZ is performed for both ^ and , or for none. 

This concludes the proof of claim [5TT21 □ 



5.3.5. End of the proof of theorem\^ To conclude the proof of theorem|El one must 
show that Tscn and Tins are both exponentially integrable. The proof is the same 
in both cases. 

We define inductively decreasing sequences of random times ([/„)„^o, (C^n )n>o, 
(C/+)„^o and (C/°)n5.o- Let C/q := 0. For every let 

f7,7 := max*(-l,3_)n] - oo,;7„[, [/+ := max 3+)n] - cx),C/„[, 

and 

C/O := min{C/-, [/+}, U^+i := max*(0, 3o)n] - oo, C/°[. 
Hence Un+i < U^i ^ , < Un and every J7„ is almost surely finite. 
For every n ^ 0, consider the event 

Sn {*(-i,3'-)n]c/„-,c/„h*(+i,3V)n];7+,c/„[=0}. 

For every n ^ 1, on the event 5„, the choice ~ C/„, t_ = [/^, = t/+ yields 
an admissible triple (t_,to,t+), hence r3 C/n+i- Let T := ?77v+i where N is the 
first integer n ^ 1 such that S'„ holds. We wish to show that T is exponentially 
integrable. 

The sequence (5"^, Un — Un+i)n^o is i.i.d. and, for every n ^ 1, 

{iv = n} = s'jn---ns'^_ins'„. 

Hence, for every real number A, 

At(A) = E[c-^^"+i ; TV = ?i] = E[c-^^i]E[e-^^i ; ^o] M'^'^^' ; 

n^l n^l 

This shows that At (A) is finite if and only if E[e~^'^i] is finite and E[e-^^i ; -Sg] < 1 
(but we recall that T is not T-i,). 

By construction, Ui ^ Uq + Uq + (C/i — Uq) and these three random variables are 
independent and exponential with parameters r(3-), ''(3+) and r(3o), respectively, 
hence E[e~'^'^^] is finite for every A smaller than these three rates. Since Uq ^ Ui, 
the same conclusion applies to Uq. 

Furthermore, f/g — Ui is independent of (Sq^Uq) and its distribution is exponen- 
tial of parameter r(3o)- Conditionally on Uq and J7°, the number of points in 
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the sets 3-)n]C/o , 0[ and 5+)n]UQ ,0[ arc independent and Poisson of 

parameters —r{y_)Uf^ and —r{3+)UQ . For every A < r(3o), this yields 

Hence the fact that Ay (A) is finite for some positive A follows from the claim below, 
with ro := r(3o), W := 1 - c^'^^'-'I'^o +^0'M and V := -U^. 

Claim 5.13. Let V and W denote positive random variables and tq a positive real 
number. Assume that V is exponentially integrahle and that W < 1 almost surely. 
For every real number X < r^, let 

F{X) _!^E[iye^^]. 

To - A 

Then there exists some positive values of X such that F{X) < 1. 

Proof of claim \5.13[ Expansions of the exponentials at order 1 with respect to the 
parameter A yield F(0) = E[W] < 1, and F'(0) = E[WV] + r^^E[W]. Hence F'(0) 
is finite and the proof of the claim is complete. □ 

5.4. Remarks. First, in our two examples, properties ([1]), ([2]), ([3]) and ^ in defini- 
tion [Tini still hold if one removes the points in '5(0, *P)n]T3, io[ from the definition 
of (here {t^,to,t^) denotes a triple such that there exists a coupling event 
based on 3 at time {t-,tQ,t+) and such that min{i_,t+} is maximal among such 
coupling events). This leads to a smaller growth parameter and Laplace transform, 
but, on the other hand, property (|4]) (the stopping property of H?) is lost, and we 
need property (U) to prove effective estimates on the ambiguity process. 

Second, as regards YpR sensitive coupling events, Tson is defined purely in terms 
of non-perturbative rules, and, in fact, the proof of theorem |E] implies that Tgon 
is an ordinary coupling time for the unperturbed dynamics. In other words, the 
content of theorem [E] is that the fact of fixing the ambiguities associated to some 
perturbative rules in 'E'njTscn, 0[ restores the coupling property of Tson- 

Third, the situation is a bit different for YpR insensitive coupling events, whose 
definition involves both perturbative and non-perturbative rules. Indeed, remov- 
ing the perturbative rules from the definitions of 3- and 3+ for YpR insensitive 
coupling events would make Tins a coupling time for the unperturbed dynamics. 
However, fixing ambiguities associated to perturbative rules in 'I'n]Tins,0[ is not 
enough to restore the coupling property of Tins- This is the reason why, in this 
example, one must introduce perturbative rules in the definition of Tins • 

Finally, note that it is possible to use our two examples of coupling times with 
ambiguities to perform perfect simulation according to the Propp- Wilson method 
(see [II]). Indeed, by the definition of a coupling time with ambiguities, for any 
finite subset i? C Z, the random times are coalescence times that allow to 
sample exactly from the projection ttb of the invariant distribution of the particle 
system onto the sites in B. Moreover, from their definition, our two examples allow 
for an efficient detection of coalescence by an algorithm. 
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5.5. Computations of growth parameters. We recall notation l5.1l in section lSTSl 

and we define inductively sequences (A„)„^o, (k„)„^i and (xn)n^i of times and a 
sequence {Pn)n^i of bits, as follows. Let Ao :~ 0. For every integer n ^ 1, let 

Kn ■= sup *(0, 3o)n] - oo, A„_i [, 

and 

Xn :=sup(*(+i,3+u3'+)u*(-i,3-u3-))n] -c3o,K„[. 

We define A„ and /3„ as follows. 

• If Xn is in ^-(+1,3+), let A„ sup ^(-l, 3- U 3'_)n] - oo, k„[; then, if A„ 
is in \E'(— 1,3-), let /3„ := 1, otherwise let /3„ := 0. 

• If Xn is in *(-l,3-), let A„ := sup 3+ U 3V)'~l] -oo,k„[; then, if A„ 
is in \E'(+1,3+), let /?„ := 1, otherwise let /3„ := 0. 

• If Xn is in *(+l,3'+) or in ^'(-1,3-), let A„ := Xn and /3„ := 0. 

For every rule 7?, in *p and n ^ 1, let 

iv„(7^) := 7^) u *(o, TZ) u 7^)) n [a„, a„_i [. 

By the independence properties of Poisson processes, for every rule TZ in *p, the 
sequence {(3n, Nn{TZ))n^i is i.i.d. Let 

G(3) min{n ^ 1 ; /3„ = 1}. 

The distribution of 0(3) is geometric on {1,2,...} and non degenerate under our 
non-degeneracy assumption that every is positive. 

The proof of the following lemma is an easy consequence of the definitions above 
and is omitted. 

Lemma 5.14. P[r3 = Ag(3)] = 1- 

As a consequence, with probability one, for every rule 72. in *p, 

G(3) 

ri=l 

so that 

G(3) 

n=l 

and 

E[#Hf n *(7e)] = p[/3i = i]-iE[iVi(7e)]. 

The proposition below follows. 
Proposition 5.15. The mean of {T^, H^) is 

Section [STBl describes a tree of successive conditional distributions leading, for every 
rule TZ in to an explicit representation of the joint distribution of Ni(TZ) and 
/3i, and to the computation of E[A^i(7^)] and P[/3i = 1] in terms of rates. 
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Notation 5.16. Introduce the shorthands ro := r(3o), 

ri := r(3+), r2 := ?'(3+), f3 ^'(3+ U 3'+) = t"! + ^2, 

r4 := r(3-), rg r(3'_), := r(3- U 3'-) = R + rg. 
Section 15.61 below leads to the following values. 

• For every TZ ~ (c, r) in *P and not in 3+ U 3'_, 

E[Ni{TZ)] ^3r(- + ^1 + ^1 + !^ 

\ro ra + re \ re rs 

• For every TZ ~ (c, r) in *p and in 3+ U 3- , 

E[Ni{n)]^r(- + ^—(2 + 3- + 3- 

\ra rs + re V rs 

The fact that every E[A^i(7?.)] is bounded by the expression for the case when TZ is 
not in 3+ U 3'_ yields the following result. 



Proposition 5.17. One has m{T^, Hf) 3M{^) ^ r,4Ai, where 

M(3) + + ^ + ^ 

rir4 Vro r^ + re \ re r^ 

5.6. Tree decompositions of conditional distributions. In this section, we 
describe the distributions of a family of random variables by placing each of them 
at the vertices of a tree. The distribution of the random variable placed at vertex 
X. is conditional to the random variables placed at vertices which are ancestors of 
X. in the tree. We use the following labelling: 1. is the root of the tree, and the 
children of a vertex x. are x.y. with y ~ I, y — 2, and so on. 

To avoid cumbersome notations, we denote by (n). a piece of "trunk" of length 
n, that is, a vertex 1.1.- •■ .1. with n ones such that no ramification starts before 
it. For instance, 1.1.1. =(3)., 1. 1.1. 1.1.1. =(5). 1. and 1.1. 1.1. 1.2. =(5). 2. Finally, we 
recall the shorthands of notation 15.11 in section 15.21 



Three distinct situations arise: the rule TZ belongs to 3+ or to 3- or to none of 
these two sets. 

5.6.1. First case: perturbative rules not in 3+U3'- . Assume that the rule TZ — (c, r) 
is in *p and not in 3+ U 3'_ • 

Vertex 1. The distribution of — ki is exponential with parameter ro. 

Vertex (2) . The distribution of # (* (- 1 , 7e) U * (0, 7e) U (+ 1 , 7e) ) n] ki , [ is Poisson 
with parameter 3r (— 

Vertex (3). The distribution of ki — xi is exponential with parameter r^ + re- 

Vertex (4). The distribution of 1, 7e) U ^-(0, 7^) U *(+l, 7^))n]xl, ki[ is Pois- 

son with parameter 3r (ki — Xi)- 
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Vertex (5). The probability that xi belongs to \E'(+1,3+ U 3+) is r^/ir^ + re), 
hence the probability that xi belongs to ^'(— 1,3- U3'_) is r6/{r3 + r6). 

Vertex (5).l. If xi belongs to ^'(+1, 3+ U3+), then the probability that xi belongs 
to ^'(+1, 3+) is Ti/rs, hence the probability that xi belongs to ^'(+1, 3+) is T-i.jro,- 

Vertex (5). 1.1. If xi belongs to ^(+1,3+), then /3i = and Ai = xi- 

Vertex (5). 1.2. If xi belongs to \E'(+1,3+), then the distribution of xi — Ai is 
exponential with parameter r%. 

Vertex (5). 1.2.1. The distribution of #(*(-l, Tl) U *(0, 7^) U *(+l, 7^))n]Al, xi [ is 
Poisson with parameter 3r (xi — Ai). 

Vertex (5). 1.2. 1.1. The probability that /3i = 1 is ri/r^, hence the probability that 
/3i = is rs/re- 

Vertex (5). 2. If xi belongs to ^'(— 1,3-U5'_), then the probability that xi belongs 
to ^'(— 1,3-) is Ti/rQ, hence the probability that xi belongs to ^'(— 1,3'_) is r^/r^. 

Vertex (5). 2.1. If xi belongs to then /3i = and Ai = xi- 

Vertex (5). 2. 2. If xi belongs to \E'(— 1,3-), then the distribution of xi — Ai is 
exponential with parameter ra. 

Vertex (5).2.2.1. The distribution of #{^{-1,11) U *(0, 7^) U ^(+1, 7^))n]Al, xi [ is 
Poisson with parameter 3r (xi — Ai). 

Vertex (5). 2. 2. 1.1. The probability that /3i = 1 is ri/rs, hence the probability that 

/3i = is r2/r3. 

5.6.2. Second case: perturbative rules in 3+- Assume that the rule TZ = (c, r) is in 

qjn3'+- 

Vertex 1. The distribution of —ni is exponential with parameter ro- 

Vertex (2). The distribution of #(*(-l, 7^)U*(0, 7^)U^'(+l, 7^))n]K;l, 0[ is Poisson 
with parameter 3r (— ki). 

Vertex (3). The distribution of ki — xi is exponential with parameter r^+rQ. 

Vertex (4). The distribution of #(*(-l, 7^) U *(0, 7e) U*(+l, 7^))n]xl, «;i[ is Pois- 
son with parameter r {ki — xi)- 

Vertex (5). The probability that xi belongs to ^'(+1,3+ U 3+) is 'r^/{rz + re), 
hence the probability that xi belongs to ^'(— 1, 3- U 3'_) is re/(r3 + re). 

Vertex (5).l. If xi belongs to ^'(+1, 3+ U3+), then the probability that xi belongs 
to ^'(+1,3+) is ri/ra, hence the probability that xi belongs to ^'(+1,3+) is r2/r3. 

Vertex (5). 1.1. If xi belongs to 'I'(+l,3'^), then /3i = and Ai = Xi, and the 
probability that xi belongs to ^'(+1,7^) is r/r2. 

Vertex (5). 1.2. If xi belongs to \E'(+1,3+), then the distribution of xi — M is 
exponential with parameter re- 

Vertex (5). 1.2.1. The distribution of #(*(-l, 7e) U *(0, 7^) U 'J'(+l, 7^))n]Al, Xi[ is 
Poisson with parameter 3r (xi — Ai). 

Vertex (5). 1.2. 1.1. The probability that /3i = 1 is r^/rQ, hence the probability that 
/3i = is rs/re. 
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Vertex (5). 2. If xi belongs to 1,3- U3'_), the probability that xi belongs to 
^'(— 1,3-) is r^/re, hence the probability that xi belongs to 1,3'_) is r^/re- 

Vertex (5).2.1. If xi belongs to *(-l,3'_), then /3i = and Ai = xi- 

Vertex (5). 2. 2. If xi belongs to ^'(— 1,3-), then the distribution of xi — Ai is 

exponential with parameter r^. 

Vertex (5). 2.2.1. The distribution of #(*(-l, 7^) U *(0, TZ) U *(+l, 7e))n]Ai, xi[ is 
Poisson with parameter 2r (xi — Ai). 

Vertex (5). 2. 2. 1.1. The probability that /3i = 1 is ri/r^, hence the probability that 
/3i = is r2/r3. If /?i = 0, the probability that xi belongs to \E'(+1,7?.) is r/r2- 

5.6.3. Third case: perturbative rules in 3'_. One can deduce this case from the 
second similar algorithm holds, obtained through the transformations 3- 

3+ and 3- ^ 3+ in the steps of the algorithm for rules in 3+- 

5.7. Example: perturbed Jukes-Ccintor model with CpG influence. We 

now apply this upper bound to perturbations of Jukes-Cantor model of evolution 
with influence of the dimiclcotidc CpG. Namely, we assume that CpG mutates to 
CpA and to TpG, both at rates 5, that every nucleotide x mutates to y ^ a; at 
rate 1 + e{x, y) with e{x, y) ^ 0. Let \e\ denote the sum over every x and y of the 
perturbations e{x,y). 

As regards sensitive coupling events, ro = ri = r4 = 4 and r2 = ^5 = 5. The mean 
of the coupling mechanism based on (Tgen, -f^sen) is bounded by ^|£| (40 + 10(5 + 5^), 
hence this coupling mechanism is subcritical as soon as |e| < £sen(^), with 

IK\ 64 
£se„iOJ - 3(4o+io^ + j2-)- 

Hence, £sen(0) = ^, esen(2) = |, £sen(10) = ^, and £sen(^) ^ when 5 +00. 

As regards insensitive coupling events, ro = 4, ri = r4 = 2, r2 is the sum of the 
modifications e{x, y) for y in R, and rs is the sum of the modifications e{x, y) for 
y in Y. The mean of the coupling mechanism based on (Tins, ^^ins) is bounded 
by ^|£|(64 + 12|e| + |ep), hence this coupling mechanism is subcritical as soon as 
|e| < £ins) where £ins is the unique positive root of 

3£i„s(64 + 12ei„s + eL) = 64. 

For every value of 5, this insensitive coupling mechanism is subcritical as soon as 

l£l < 

Theorem F. Every perturbation by rates e of the Jukes-Cantor model with CpG 
influence of magnitude 5 is ergodic as soon as \e\ < max{£ins,£sen('^)}- Further- 
more, every finite marginal converges exponentially fast to the corresponding finite 
marginal of the stationary distribution, and the correlations of the stationary dis- 
tribution decay exponentially fast. 
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